Skip to content

[TensileLite] Add FP8FP8S#304

Merged
KKyang merged 1 commit into
ROCm:developfrom
KKyang:fp8fp8s
Sep 6, 2023
Merged

[TensileLite] Add FP8FP8S#304
KKyang merged 1 commit into
ROCm:developfrom
KKyang:fp8fp8s

Conversation

@KKyang
Copy link
Copy Markdown
Contributor

@KKyang KKyang commented Sep 6, 2023

No description provided.

@KKyang KKyang force-pushed the fp8fp8s branch 2 times, most recently from 762b4e2 to 35c1f21 Compare September 6, 2023 06:17
@KKyang KKyang marked this pull request as ready for review September 6, 2023 06:17
Copy link
Copy Markdown
Collaborator

@jichangjichang jichangjichang left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@KKyang
Copy link
Copy Markdown
Contributor Author

KKyang commented Sep 6, 2023

gfx940 local test passed

@KKyang KKyang merged commit bfe2b47 into ROCm:develop Sep 6, 2023
assistant-librarian Bot pushed a commit that referenced this pull request Jul 10, 2025
Changing sgpr limits (#304)
MIME-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: 8bit

This includes 2 changes:
- Unrestricted the temp sgprs needed for gsu from being contiguous,
avoiding overflow for certain kernels
- Account for additional temp sgprs that will be required for code gen,
up to physical limits
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants